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RELATED APPLICATIONS 

This application is a divisional application of co-pending U.S. Patent Application 
Serial No. 09/364,528, filed on July 30, 1999 entitled "Methods for Routing Documents 
Based on a Measure of Criticality." This application is related to the cofiled, copending 

10 and coassigned applications entitled "Methods for Automatically Assigning Priorities to 
Documents and Messages" filed on July 30, 1999 and assigned Serial No. 09/364,527, 
"Integration of a Computer-Based Message Priority System with Mobile Electronic 
Devices" filed on July 30, 1999 and assigned Serial No. 09/365,293, "Methods for 
Display, Notification, and Interaction with Prioritized Messages" filed on July 30, 1999 

15 and assigned Serial No. 09/364,522, and "A Computational Architecture for Managing 
the Transmittal and Rendering of Information, Alerts, and Notifications" filed on July 30, 
1999 and assigned Serial No. 09/365,287. The above-noted applications are incorporated 
herein by reference. 

20 FIELD OF THE INVENTION 

This invention relates generally to prioritized text such as prioritized email 
messages, and more particularly to the routing of such prioritized text. 



BACKGROUND OF THE INVENTION 

25 Electronic mail programs have become a popular application among computer 

users. Especially with the advent of the Internet, exchanging email has almost become a 
reason why people purchase computers for personal reasons, and within many corporate 
environments, email has become the standard manner by which coworkers exchange 
information. However, with the increasing popularity of email, shortcomings have 

30 become apparent. 
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Chief among these shortcomings is that many users now face a deluge of email 
every day, such that the capability of being able to send and receive email has almost 
-become a hindrance to their day-to-day ability to get their job done, as opposed to being 
an asset. Some users report receiving over 100 email messages a day. With such large 
5 numbers of email, it is difficult to manage the email, such that the users read the most 
important messages first. 

Limited solutions to this problem have been attempted in the prior art. Prior art 
exists for attempting to curtail the amount of junk email - e.g., unsolicited email, typically 
regarding a service or product for sale - that users receive. Moreover, some electronic 
10 mail programs allow for the generation of rules that govern how an email is managed 
within the program - for example, placing all emails from certain coworkers in a special 
folder. 

These limited solutions, however, do not strike at the basic problem behind email 
- that with so much email being received, it would be most useful for a user to be able to 
1 5 have his or her computer automatically prioritize the email by importance or review 
urgency, and perform actions based on that prioritization. For these and other reasons, 
there is a need for the present invention. 

SUMMARY OF THE INVENTION 

20 The invention relates to the routing of prioritized text such as email messages. In 

one embodiment, a computer-implemented method first receives a text. The method 
generates a priority of the text, based on a text classifier such as a Bayesian classifier or a 
support-vector machine classifier. The method then routes the text based on a routing 
criteria. 

25 Embodiments of the invention provide for advantages over the prior art. A user, 

for example, in one embodiment, may ask that he or she only be disturbed if the priority 
of the text is greater than a given threshold. Thus, even if the user receives over 1 00 
different emails, he or she will be alerted to the most important email, and then will be 
able to deal with the other email when the user has time. Prioritization, in other words, 

30 makes email much more useful in environments where a lot of email is exchanged on a 
regular basis. 
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Embodiments of the invention include computer-implemented methods, 
computer-readable media, and computerized systems of varying embodiments. Still other 
.embodiments, advantages and aspects of the invention will become apparent by reading 
the following detailed description, and by reference to the drawings. 

5 

BRIEF DESCFUPTION OF THE DRAWINGS 

FIG- 1 is a diagram of an operating environment in conjunction with which 
embodiments of the invention may be practiced; 

FIG. 2 is a diagram showing explicit and implicit training of a text classifier, 
1 0 according to an embodiment of the invention; 

FIG. 3 is a diagram showing how a priority for a text is generated by input to a 
text classifier, according to an embodiment of the invention; 

FIG. 4(a) is a diagram of a scheme according to which the priority of a text can be 
classified, according to an embodiment of the invention; 
1 5 FIG. 4(b) is a diagram of another scheme according to which the priority of a text 

can be classified, according to another embodiment of the invention; 

FIG. 5(a) is a graph showing linear cost functions of high, medium and low 
priority texts, according to an embodiment of the invention; 

FIG. 5(b) is a graph showing a non-linear cost function for a text, according to an 
20 embodiment of the invention; 

FIG. 6 is a diagram showing classes of evidence that can be used to make an 

inference about a user's activity (e.g., if a user is present), according to one embodiment 

of the invention; 

FIG. 7 is a diagram showing a Bayesian network that can be used for inferring a 
25 user's activity (e.g., if a user is present), according to one embodiment of the invention; 

FIGS. 8-10 are influence diagrams showing how in one embodiment decision 
models can be utilized to make the decision as to how and when to alert a user to a 

message; 

FIG. 1 1 is a flowchart of a method according to an embodiment of the invention; 
30 FIG. 12 is a diagram of a system according to an embodiment of the invention; 
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FIG. 13 is a diagram of a system according to another embodiment of the 
•invention; and, 

FIGS. 14(a) and 14(b) are diagrams of a user interface via which routing criteria 
can be modified, according to an embodiment of the invention. 

DETAILED DESCRIPTION OF THE INVENTION 

In the following detailed description of exemplary embodiments of the invention, 
reference is made to the accompanying drawings which form a part hereof, and in which 
is shown by way of illustration specific exemplary embodiments in which the invention 
may be practiced. These embodiments are described in sufficient detail to enable those 
skilled in the art to practice the invention, and it is to be understood that other 
embodiments may be utilized and that logical, mechanical, electrical and other changes 
may be made without departing from the spirit or scope of the present invention. The 
following detailed description is, therefore, not to be taken in a limiting sense, and the 
scope of the present invention is defined only by the appended claims. 

Some portions of the detailed descriptions which follow are presented in terms of 
algorithms and symbolic representations of operations on data bits within a computer 
memory. These algorithmic descriptions and representations are the means used by those 
skilled in the data processing arts to most effectively convey the substance of their work 
to others skilled in the art. An algorithm is here, and generally, conceived to be a self-, 
consistent sequence of steps leading to a desired result. The steps are those requiring 
physical manipulations of physical quantities. Usually, though not necessarily, these 
quantities take the form of electrical or magnetic signals capable of being stored, 
transferred, combined, compared, and otherwise manipulated. 

It has proven convenient at times, principally for reasons of common usage, to 
refer to these signals as bits, values, elements, symbols, characters, terms, numbers, or the 
like. It should be borne in mind, however, that all of these and similar terms are to be 
associated with the appropriate physical quantities and are merely convenient labels 
applied to these quantities. Unless specifically stated otherwise as apparent from the 
following discussion, it is appreciated that throughout the present invention, discussions 
utilizing terms such as processing or computing or calculating or determining or 
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displaying or the like, refer to the action and processes of a computer system, or similar 
electronic computing device, that manipulates and transforms data represented as 
.physical (electronic) quantities within the computer system's registers and memories into 
other data similarly represented as physical quantities within the computer system's 
5 memories or registers or other such information storage, transmission or display devices. 
(It is noted that the terms document and text are used interchangeably herein and should 
be construed as interchangeable as well.) 

Operating Environment 

1 0 Referring to FIG. 1 , a diagram of the hardware and operating environment in 

conjunction with which embodiments of the invention may be practiced is shown. The 
description of FIG. 1 is intended to provide a brief, general description of suitable 
computer hardware and a suitable computing environment in conjunction with which the 
invention may be implemented. Although not required, the invention is described in the 

1 5 general context of computer-executable instructions, such as program modules, being 
executed by a computer, such as a personal computer. Generally, program modules 
include routines, programs, objects, components, data structures, etc., that perform 
particular tasks or implement particular abstract data types. 

Moreover, those skilled in the art will appreciate that the invention may be 

20 practiced with other computer system configurations, including hand-held devices, 

multiprocessor systems, microprocessor-based or programmable consumer electronics, 
network PC's, minicomputers, mainframe computers, and the like. The invention may 
also be practiced in distributed computing environments where tasks are performed by 
remote processing devices that are linked through a communications network. In a 

25 distributed computing environment, program modules may be located in both local arid 
remote memory storage devices. 

The exemplary hardware and operating environment of FIG. 1 for implementing 
the invention includes a general purpose computing device in the form of a computer 20, 
including a processing unit 21, a system memory 22, and a system bus 23 that operatively 

30 couples various system components including the system memory to the processing unit 
21. There may be only one or there may be more than one processing unit 2 1 , such that 
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the processor of computer 20 comprises a single central-processing unit (CPU), or a 
plurality of processing units, commonly referred to as a parallel processing environment. 
.The computer 20 may be a conventional computer, a distributed computer, or any other 
type of computer; the invention is not so limited. 
5 The system bus 23 may be any of several types of bus structures including a 

memory bus or memory controller, a peripheral bus, and a local bus using any of a 
variety of bus architectures. The system memory may also be referred to as simply the 
memory, and includes read only memory (ROM) 24 and random access memory (RAM) 
25. A basic input/output system (BIOS) 26, containing the basic routines that help to 

10 transfer information between elements within the computer 20, such as during start-up, is 
stored in ROM 24. The computer 20 further includes a hard disk drive 27 for reading 
from and writing to a hard disk, not shown, a magnetic disk drive 28 for reading from or 
writing to a removable magnetic disk 29, and an optical disk drive 30 for reading from or 
writing to a removable optical disk 31 such as a CD ROM or other optical media. 

1 5 The hard disk drive 27, magnetic disk drive 28, and optical disk drive 30 are 

connected to the system bus 23 by a hard disk drive interface 32, a magnetic disk drive 
interface 33, and an optical disk drive interface 34, respectively. The drives and their 
associated computer-readable media provide nonvolatile storage of computer-readable 
instructions, data structures, program modules and other data for the computer 20. It 

20 should be appreciated by those skilled in the art that any type of computer-readable media 
which can store data that is accessible by a computer, such as magnetic cassettes, flash 
memory cards, digital video disks, Bernoulli cartridges, random access memories 
(RAMs), read only memories (ROMs), and the like, may be used in the exemplary 
operating environment. 

25 A number of program modules may be stored on the hard disk, magnetic disk 29, 

optical disk 31, ROM 24, or RAM 25, including an operating system 35, one or more 
application programs 36, other program modules 37, and program data 38. A user may 
enter commands and information into the personal computer 20 through input devices 
such as a keyboard 40 and pointing device 42. Other input devices (not shown) may 

30 include a microphone, joystick, game pad, satellite dish, scanner, or the like. These and 
other input devices are often connected to the processing unit 21 through a serial port 
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interface 46 that is coupled to the system bus, but may be connected by other interfaces, 
'such as a parallel port, game port, or a universal serial bus (USB). A monitor 47 or other 
.type of display device is also connected to the system bus 23 via an interface, such as a 
'video adapter 48. In addition to the monitor, computers typically include other peripheral 
5 output devices (not shown), such as speakers and printers. 

The computer 20 may operate in a networked environment using logical 
connections to one or more remote computers, such as remote computer 49. These 
logical connections are achieved by a communication device coupled to or a part of the 
computer 20; the invention is not limited to a particular type of communications device. 
y 1 0 The remote computer 49 may be another computer, a server, a router, a network PC, a 
client, a peer device or other common network node, and typically includes many or all 
of the elements described above relative to the computer 20, although only a memory 
storage device 50 has been illustrated in FIG. 1 . The logical connections depicted in FIG. 
1 include a local-area network (LAN) 51 and a wide-area network (WAN) 52. Such 
1 5 networking environments are commonplace in office networks, enterprise-wide computer 
networks, intranets and the Internet, which are all types of networks. 

When used in a LAN-networking environment, the computer 20 is connected to 
the local network 5 1 through a network interface or adapter 53, which is one type of 
communications device. When used in a WAN-networking environment, the computer 
20 20 typically includes a modem 54, a type of communications device, or any other type of 
communications device for establishing communications over the wide area network 52, 
such as the Internet. The modem 54, which may be internal or external, is connected to 
the system bus 23 via the serial port interface 46. In a networked environment, program 
modules depicted relative to the personal computer 20, or portions thereof, may be stored 
25 in the remote memory storage device. It is appreciated that the network connections 

shown are exemplary and other means of and communications devices for establishing a 
communications link between the computers may be used. 

Generating a Priority for a Text 
30 In this section of the detailed description, the generation of a priority for a text 

such as an email, according to one embodiment of the invention, is described. The 
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generation of priorities for texts as described can then be used in methods, systems, and 
-computer-readable media (as well as other embodiments) of the invention as are 
.presented in other sections of the detailed description. The description in this section is 
"provided in conjunction with FIG. 2 and FIG. 3, the former which is a diagram showing 
5 explicit and implicit training of a text classifier, according to an embodiment of the 

invention, and the latter which is a diagram showing how a priority for a text is generated 
by input to a text classifier, according to an embodiment of the invention. The 
description is also provided in conjunction with FIGS. 4(a) and 4(b), which are diagrams 
of different schema according to which the priority of a text can be classified, and in 

10 conjunction with FIGS. 5(a) and 5(b), which are graphs showing different cost functions 
that may be applicable depending on text type. 

Referring first to FIG. 2, the text classifier 200 is able to be trained both 
explicitly, as represented by the arrow 202, and implicitly, as represent by the arrow 204. 
The explicit training represented by the arrow 202 is usually conducted at the initial 

1 5 phases of constructing the text classifier 200, while the implicit training represented by 
the arrow 204 is usually conducted after the text classifier 200 has been constructed, to 
fine tune the classifier 200. However, the invention is not so limited. 

The text classifier 200 in one embodiment is a Bayesian classifier, as known 
within the art, while in another embodiment it is a support vector machine (SVM) 

20 classifier, as also known within the art. Text classification methodology based on a 
Bayesian learning approach is specifically described in the reference M. Sahami, S. 
Dumais, D. Heckerman, E. Horvitz, A Bayesian Approach to Junk E-Mail Filtering, 
AAA! Workshop on Text Classification, July 1998, Madison, Wisconsin, AAA1 
Technical Report WS-98-05, which is hereby incorporated by reference. Text 

25 classification methodology based on an SVM approach is specifically described in the 
following references: the coassigned patent, U.S. Pat. No. 5,864,848, issued January 26, 
1999, which is hereby incorporated by reference; the previously filed and coassigned case 
entitled "Methods and Apparatus for Building a Support Vector Machine Classifier," 
serial no. 09/055,477, filed on April 6, 1 998, which is also hereby incorporated by 

30 reference; and, the reference J. Piatt, Fast Training of Support Vector Machines using 

Sequential Minimal Optimization, MIT Press, Baltimore, MD, 1998, which is also hereby 
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incorporated by reference. For purposes of this application, specific description is made 
-with reference to an SVM classifier, although those of ordinary skill within the art can 
^appreciate that the invention is not so limited. 

8 As shown in FIG. 2, the explicit training of the text classifier 200 as represented 

5 by the arrow 202 includes constructing the classifier in 206, including utilizing feature 
selection. In general, Support Vector Machines build classifiers by identifying a 
hyperplane that separates a set of positive and negative examples with a maximum 
margin. In the linear form of SVM that is employed in one embodiment, the margin is 
defined by the distance of the hyperplane to the nearest positive and negative cases for 

10 each class. Maximizing the margin can be expressed as an optimization problem. A 
post-processing procedure described in the Piatt reference is used that employs 
regularized maximum likelihood fitting to produce estimations of posterior probabilities. 
The method fits a sigmoid to the score that is output by the SVM classifier. 

In the explicit training, the text classifier is presented with both time-critical and 

1 5 non-time-critical texts (eg., email messages), so that it may be able to discriminate 

between the two. This training set may be provided by the user, or a standard training set 
may be used. Given a training corpus, the text classifier first applies feature-selection 
procedures that attempt to find the most discriminatory features. This process employs a 
mutual-information analysis. Feature selection can operate on single words or higher 

20 level distinctions made available, such as phrases and parts of speech tagged with natural 
language processing - that is, the text classifier 200 is able to be seeded with specially 
tagged text to discriminate features of a text that are considered important. 

Feature selection for text classification typically performs a search over single 
words. Beyond the reliance on single words, domain-specific phrases and high-level 

25 patterns of features are also made available. Special tokens can also enhance 

classification. The quality of the learned classifiers for email criticality can be enhanced 
by inputting to the feature selection procedures handcrafted features that are identified as 
being useful for distinguishing among email of different time criticality. Thus, during 
feature selection, single words as well as special phrases and symbols that are useful for 

30 discriminating among messages of different levels of time criticality are considered. 
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Tokens and patterns of value in identifying the criticality of messages include 
-such distinctions as (including Boolean combinations thereof): 



5 To: field 

Addressed just to user 

Addressed to only a few people including user 
Addressed to an alias with a small number of people 
Addressed to several aliases with a small number of people 
10 Cc:'dtouser 
Bcc:'dtouser 

People 

Names on pre-determined list of important people 

15 

Family members 
People at company 
Organization chart structure 

Managers 1 report to 
20 Managers of managers of people 1 report to 

People who report to me 
External business people 

Past tense 

25 These include descriptions about events that have already occurred such as: 

We met 
meeting went 
happened 
30 got together 
took care of 
meeting yesterday 

Future tense 

35 Tomorrow 

This week - 

Are you going to 
When can we 

40 Meeting and coordination 

Get together 
Can you meet 
Will get together 
Coordinate with 
45 Need to get together 

Resolved dates 

Dates indicated from text and msg. time (e.g., tomorrow, send yesterday) 

50 Questions 

Word + ? 

Indications of personal requests: 
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Can you 
T Are you 
'Will you 
you please 
5 -Can you do 

Indications of need: 

1 need 
He needs 
1 0 She needs 
I'd like 

It would be great 
I want 
He wants 
1 5 She wants 

Take care of 

Time criticaljry 

happening soon 
20 right away 

deadline will be 

deadline is 

as soon as possible 

needs this soon 
25 to be done soon 

done right away 

Importance 

is important 
30 is critical 

Word + ! ' 
Explicit priority flag status (low, none, high) 

Length of message 
35 Number of bytes in component of new message 

Signs of Commercial and Adult-Content Junk email 

Free!! 

!!! ' " '* "* " ' " " " ' '"" "-' ' '. ' <" 

40 Under 18 
Adult 

Percent caps 

Percent nonalphanumeric characters 
etc. 

45 

Other features that may be used for feature selection are described in the cofiled, 
copending and coassigned application entitled "A Computational Architecture for 
Managing the Transmittal and Rendering of Information, Alerts, and Notifications" filed 
on July 30, 1999 and assigned Serial No. 09/365,287, which is hereby incorporated by 
50 reference, and in the copending and coassigned application entitled "Methods and 
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Apparatus for Building a Support Vector Machine Classifier," serial no. 09/055,477, filed 
on April 6, 1998, which has already been incorporated by reference. 

Furthermore, still referring to FIG. 2, implicit training of the text classifier 200, as 
represented by the arrow 204, can be conducted by continually watching the user work in 
5 210. The assumption is that as users work, and lists of mail are reviewed, time-critical 
messages are read first, and low-priority messages are reviewed later, or just deleted. 
That is, when presented with a new email, the user is watched to determine whether she 
immediately opens the email, and in what order (if more than one new email are present), 
deletes the email without opening, and/or replies to the email right away. Thus, the text 

10 classifier is such that a user is continually watched while working, and the classifier is 
continually refined by training in the background and being updated in real time for 
decision making. For each message inputted into the classifier, a new case for the 
classifier is created. The cases are stored as negative and positive examples of texts that 
are either high or low priority. 

1 5 Referring next to FIG. 3, a text, such as an email message, 300 is input into the 

text classifier 200, which based thereon generates a priority 302 for the text 300. That is, 
in one embodiment, the text classifier 200 generates a priority 302, measured as a 
percentage from 0 to 1 (i.e., 0% to 1 00%). This percentage is a measure of the likelihood 
that the text 300 is of high priority, based on the previous training of the classifier 200. 

20 The priority in one embodiment is a measure of the rate of cost accrued with delayed 
review of the document. 

In addition, the priority can be based on the case where the user has not yet 

reviewed a document already. In such an instance, as described in more detail in the 
related cases already incorporated by reference, this can be based on 

25 NEVA- EVTA - ECA - TC 

where NEVA is the net expected value of alerting, EVTA is the expected value of 
alerting, ECA is the expected cost of alerting, and TC is the transmission cost. Thus, 
alerting can be made when the NEVA exceeds a predetermined threshold. For example, 
if size of message is important, then a bin-packing approximation, as known within the 

30 art, can be employed, to attempt to prioritize transfers by NEVA divided by the message 
size in bytes to guide the transmission ordering. Alternatively, a threshold on the 
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traction, above which transfers are made and below which transfers are suppressed, can 
^also be utilized. 

It is noted that as has been described, the text classifier and the priority generated 
thereby is based on a scheme where each email in the training phase is construed as either 
5 high priority or low priority, such that the priority generated by the text classifier is a 
measure of the likelihood of the text being analyzed is of high priority. This scheme is 
specifically shown by reference to FIG. 4(a), where the text classifier 200 is trained by a 
group of texts 400 that are high priority and a group of texts 402 that are low priority, 
such that a text to be analyzed 400 is input into the classifier 200, which outputs a scalar 
10 number 406 measuring the likelihood that the text being analyzed is of high priority. 

However, those of ordinary skill within the art can appreciate that the invention is not so 
limited. 

For example, referring to FIG. 4(b), a diagram showing a scheme where texts are 
divided into low, medium and high priority, according to an embodiment of the 

1 5 invention, is shown. The text classifier 200 in the embodiment of FIG. 4(b) is trained by 
a group of texts 400 that are high priority and a group of texts 402 that are low priority, as 
in the previous embodiment, but also by a group of texts 450 that are medium priority. 
Thus, a text to be analyzed 400 is input into the classifier 200, which outputs a scalar 
number 406, that can measure the likelihood that the text being analyzed is of high 

20 priority, if so desired, or medium priority or low priority. The classifier 200 is also able 
to output a class 452, which indicates the class of low, medium or high priority that the 

text 4Q4 most likely falls into. Those of ordinary skill within the art can appreciate that 

further classes can also be added if desired. 

The invention is not limited to the definition of priority as this term is used by the 

25 text classifier to assign such priority to a text such as an email message. In one 

embodiment, however, priority is defined in terms of a loss function. More specifically, 
priority is defined in terms of the expected cost in lost opportunities per time delayed in 
reviewing the text after it has be received - that is, the expected lost or cost that will result 
for delayed processing of the text. This loss function can further vary according to the 

30 type of text received. 
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For example, the general case is shown in FIG. 5(a), which is a graph of linear 
r cost functions dependent on the priority of a text. In the graph 500, as time increases, the 
-cost of not having reviewed a text also increases. However, the cost increases more for a 
high priority message, as indicated by the line 502, as compared to a medium priority 
5 message, as indicated by the line 504, or a low priority message, as indicated by the line 
506. That is, the high priority line 502 may have a slope of 100, the medium priority line 
504 may have a slope of 10, and the low priority line 502 may have a slope of 1 . These 
slope values can then be used by the text classifier to assist in assigning a priority to a 
given text, for example, by regression analysis. 

1 0 Some messages, however, do not have their priorities well approximated by the 

use of a linear cost function. For example, a message relating to a meeting will have its 
cost function increase as the time of the meeting nears, and thereafter, the cost function 
rapidly decreases - since after the meeting is missed, there is not much generally a user 
can do about it. This situation is better approximated by a non-linear cost function, as 

15 shown in FIG. 5(b). In the graph 550, the cost function 554 rapidly increases until it 
reaches the time of the meeting demarcated by the line 552, after which it rapidly 
decreases. Thus, those of ordinary skill within the art can appreciate that depending on a 
message's type, the cost function can be approximated by one of many different 
representative cost functions, both linear and non-linear. 

20 Thus, as has been described, the priority of a text can be just the likelihood that it 

is of high priority based on output of a text classifier, or the most likely priority class - 
(i.e., medium, low or high priority) it falls into, also based on the output of the text 
classifier. However, in another embodiment of the invention, an expected time criticality 
of each text, such as an email message, is determined. This can be written as 

25 el = Yj P( critical i ^{critical, ) 

where EL is the expected loss, p(critical,) is the probability that a text has the criticality i 
(e.g., where i=0 may be low priority and HI may be high priority, or where i=0 may be 
low priority, i=I medium priority and i=2 high priority, etc.), C(critical,) is the cost 
function for text having the criticality i, and n is the total number of criticality classes 
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minus one. The cost functions may be linear or non-linear, as has been described - in the 
-case where the function are linear, the cost function is thus the rate of loss. 

In the case where n=l, specifying that there are only two priority classes low and 
high, the expected loss can be reformulated as 
5 EC = p(critical high )C(critical high ) + [l - p{critical }ow )]C\critical h J 

where EC is the expected criticality of a text. Furthermore, if the cost function of low 
criticality messages is set to zero, this becomes 

EC = p(cn//ra/ ///e/ ,)c(c/77/ca/ / ,^) 

The total loss until the time of review of a text can be expressed as the integration of the 
10 expressed criticality, or, 

el = I p( critica h,, g ,> ) 

•• 1 
where t is the time ot review. 

Determining When to Alert the User 

1 5 In this section of the detailed description, described is provided as to determining 

when to alert the user of a high-priority text, for example, a text that has a likelihood of 
being high priority greater than a user-set threshold, or greater than a threshold 
determined by decision theoretic reasoning. That is, beyond knowing about time critical 
messages, it is also important in one embodiment to decide when to alert a user to time 

20 critical messages if the user is not directly viewing incoming email (in one embodiment). 
In the general case, a cost of distracting the user from the current task being addressed to 
learn about the time-critical message is determined. 

In general, a user should be alerted when a cost-benefit analysis suggests that the 
expected loss the user would incur in not reviewing the message at time t is greater than 

25 the expected cost of alerting the user. That is, alerting should be conducted if 

EL-EC>Q 

where EL is the expected loss of non-review of the text at a current time t, and EC is the 
expected cost of alerting the user of the text at the current time t. The expected loss is as 
described in the previous section of the detailed description. 
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However, this formulation is not entirely accurate, because the user is assumed to 
review the message on his or her own at some point in the future anyway. Therefore, in 
-actuality, the user should be alerted when the expected value of alerting, referred to as 
ECA, is positive. The expected value of alerting should thus consider the value of 
5 alerting the user of the text now, as opposed to the value of the user reviewing the 
message later on his or her own, without alert, minus the cost of alerting. This can be 
stated as 

EVA- EL alen - EL ml _ alerl EC 

where EL d \ m is the expected loss of the user reviewing the message if he or she were to 
1 0 review the message now, upon being alerted, as opposed to EL n0 -aiert, which is the 

expected loss of the user reviewing the message on his or her own at some point, without 

being alerted, minus EC, the expected cost of alerting (now). 

Furthermore, in one specific embodiment of the invention, information from 

several messages is grouped together into a single compound alert. Reviewing 
1 5 information about multiple messages in an alert can be more costly than an alert relaying 

information about a single message. Such increases in distraction can be represented by 

making the cost of an alert a function of its informational complexity. It is assumed that 

the EVA of an email message is independent of the EVA of the other email messages. 

EVA(Mj,/) is used to refer to the value of alerting a user about a single message A/, at 
20 time t and EGA(;?) is used to refer to the expected cost of relaying the content of n 

messages. Thus, multiple messages can be considered by summing together the expected 

value of relaying information about a set of n messages, : • 

-NEVA = '£ d EVA(M n t)- ECA(n). 

25 In one embodiment of the invention, it is noted that determining when to alert the 

user is conducted in accordance with the more rigorous treatment of EVA described in 
the copending, cotiled and coassigned application entitled "A Computational 
Architecture for Managing the Transmittal and Rendering of Information, Alerts, and 
Notifications" filed on July 30, 1999 and assigned Serial No. 09/365,287, which is hereby 

30 incorporated by reference. However, the invention is not so limited. 
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It is also noted that in order to determine the expected cost of alerting, it is useful 
io infer or directly access information about whether the user is present - and therefore 
.can see or hear alerts from the computer - or is not present. Sensors can be used in one 
embodiment that indicate when a user is in the office, such as infrared sensors, pressure 
5 sensors (on the chair), etc. However, if such devices are not available, a probability that a 
user is in the office can be assigned as a function of user activity on the computer, such as 
the time since last observed mouse or keyboard activity. Furthermore, scheduling 
information available in a calendar can also be made use of to make inferences about the 
distance and disposition of a user, to consider the costs of forwarding messages to the 

10 user by different means (e.g., cell phone, pager, etc.). 

It is also important to know how busy the user is in making decisions about 
interrupting the user with information about messages with high time criticality. In one 
embodiment, it is reasoned about whether and the rate at which a user is working on a 
computer, or whether the user is on the telephone, speaking with someone, or at a 

1 5 meeting at another location. In one embodiment, several classes of evidence can be used 
to assess a user's activity or his or her focus of attention, as shown in FIG. 6. A Bayesian 
network, as known in the art, can then be used for performing an inference about a user's 
activity; an example of such a network is shown in FIG. 7. Utilizing evidence to infer 
whether the user is present is described more rigorously in the cofiled, copending and 

20 coassigned application entitled "A Computational Architecture for Managing the 

Transmittal and Rendering of Information, Alerts, and Notifications" filed on July 30, 

and assigned Serial No. 09/365,287, which has already been incorporated by 
reference (specifically, with respect to determining one or more probabilities). Thus, in 
one embodiment, a probability inference as to whether a user is present is determined in 

25 accordance with the description provided in this application. 

In genera], a decision should be made as to when and how to alert users to 
messages and to provide services (for example) based on the inference of expected 
criticality and user activity. In one embodiment, this decision is made as described in the 
cofiled, copending and coassigned application entitled "A Computational Architecture for 

30 Managing the Transmittal and Rendering of Information, Alerts, and Notifications" filed 
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on July 30, 1999 and assigned Serial No. 09/365,287, which has already been 
-incorporated by reference. 

In another embodiment, this decision is made by utilizing decision models, as 
known within the art. FIGs. 8-10 are influence diagrams, as known within the art, 
5 showing how in one specific embodiment such decision models can be utilized to make 
this decision. Specifically, FIG. 8 displays a decision model for decisions about 
interrupting a user, considering current activity, expected time criticality of messages, 
and cost of alerting depending on the modality. FIG. 9 also includes variables 
representing the current location and the influence of that variable on activity and cost of 

10 the alternate messaging techniques. Finally, FIG. 10 is further expanded to consider the 
costs associated with losses in fidelity when a message with significant graphics content 
is forwarded to a user without the graphical content being present. 

In still another embodiment, the decision as to when and how to alert users is 
made by employment of a set of user-specified thresholds and parameters defining 

1 5 policies on alerting. In this embodiment, user presence can be inferred based on mouse 
or keyboard activity. Thus, a user can be allowed to input distinct thresholds on alerting 
for inferred states of activity and nonactivity. Users can input an amount of idle activity 
following activity where alerting will occur at lower criticalities. In this embodiment, if 
it is determined that the user is not available based on the time that no computer activity 

20 is seen - or on the user's inactivity when an attempt to alert is made - then messages are 
stored, and are reported to the user in order of criticality when the user returns to interact 
with the computer (or, returns to the room, given the availability of inputs from infrared 
or other presence detection). 

Furthermore, in this embodiment, users can specify routing and paging options (as 

25 well as other output options) as a function of quantities including expected criticality, 
maximum expected loss, and value of alerting the user. Such routing, paging and other 
output options are more specifically described in the copending, cofiled, and coassigned 
applications entitled 'Integration of a Computer-Based Message Priority System with 
Mobile Electronic Devices" filed on July 30, 1999 and assigned Serial No. 09/365,293, 

30 and "Methods for Display, Notification, and Interaction with Prioritized Messages" filed 
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on July 30, 1999 and assigned Serial No. 09/364,522, which are all hereby incorporated 
"by reference. The invention is not so limited, however. 

Method and System 

5 In this section of the detailed description, a computer-implemented method 

according to an embodiment of the invention is described, and a computerized system 
according to an embodiment of the invention is described. With respect to the method, 
the method is desirably realized at least in part as one or more programs running on a 
computer — that is, as a program executed from a computer-readable medium such as a 

1 0 memory by a processor of a computer. The program is desirably storable on a machine- 
readable medium such as a floppy disk or a CD-ROM, for distribution and installation 
and execution on another computer. 

Referring to FIG. 1 1 , a flowchart of a method according to an embodiment of the 
invention is shown. In 900, a text to have a priority thereof assigned is received. The 

1 5 text can be an email message, or any other type of text; the invention is not so limited. In 
902, the priority of the text is generated, based on a text classifier, as has been described. 
Thus, in one embodiment, 902 includes initially training and continually training the text 
classifier, as has been described. 

The priority of the text is then output in 904. In one embodiment, as indicated in 

20 FIG. 1 1, this can include 906, 908, 910, 912 and 914; however, the invention is not so 
limited. In 906, an expected loss of non-review of the text at a current time / is 
determined - in one embodiment, by also considering the expected loss of non-review of 
■ the text at a future time, based on the assumption that ultimately the user will review the 
text him or herself, without being alerted, as has been described. In 908, an expected cost 

25 of alerting is determined, as has also been described. If the loss is greater than the cost in 
910, then no alert is made at the time /, and the method proceeds back to 906, to 
redetermine the cost-benefit analysis, at a new current time /. This is done because as 
time progresses, the expected loss may at some point outweigh the alert cost, such that 
the calculus in 910 changes. Upon the expected loss outweighing the alert cost, then an 

30 , alert to the user is performed in 914, as has been described. 
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In one embodiment, the output of the alert is performed as is now deseribed. The 
alert is performed by routing the prioritized text based on a routing criteria. Routing 
criteria that can be used in conjunction with embodiments of the invention is not limited 
"by the invention; however, in one embodiment, the routing criteria is as described in a 
5 further section of the detailed description. Routing of the text can include forwarding the 
text, or replying to the sender of the text, in the case where the text is email. 

Referring next to FIG. 12, a diagram of a system according to an embodiment of 
the invention is shown. The system includes a program 950 and a text classifier 952. 
Each of the program 950 and the classifier 952 include a computer program executed by 
10 a processor of a computer from a computer-readable medium thereof, in one 
embodiment. However, the invention is not so limited. 

The program 950 generates a text for input into the text classifier 952. In one 
embodiment, the program includes an electronic mail program that receives email, which 
then serve as the text. The text classifier 952, based on the text, generates a priority 
1 5 thereof, as has been described. In one embodiment, the text classifier 952 is a Bayesian 
text classifier, while in another embodiment, it is a Support Vector Machine classifier. 
The priority of the text output by the text classifier 952 can then be used in further 
conjunction with a cost-benefit analysis, as has been described, to effectuate further 
output and/or alerting based thereon, as has been described. The invention is not so 
20 limited, however. 

Referring next to FIG. 13, a diagram of a system according to another 

: embodiment of the invention is shown. The system of FIG. 1 3 incl udes an additional 

component, a routing mechanism 970. Not shown in FIG. 1 3 are the program 950 and 
the text classifier 952; however, the routing mechanism 970 is operatively and/or 
25 communicatively coupled to the latter. In one embodiment, the mechanism 970 includes 
a computer program executed by a processor of a computer from a computer-readable 
medium thereof, but the invention is not so limited. 

The routing mechanism 970, as shown in FIG. 13, receives a prioritized text, and 
based on a routing criteria, is able to reply to the sender of the text (in the case where the 
30 text is an email message), in which case the mechanism is a replying mechanism. Also 
based on the routing criteria, the mechanism 970 is able to forward the text, for example, 
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to a different email address, in which case the mechanism is a forwarding mechanism. 
The former may be useful when the user wishes to indicate to the sender of a message 
that the user is not present, and thus may provide the sender with contact information as 
"to how to reach the user. The latter may be useful when the user has email access to a 
5 different email address, such as a web-based email address as known in the art (examples 
include Microsoft Corp.'s HotMail at http://www.hotmail.com ), such that the user wishes 
to be kept informed of high priority emails at this alternative address. The invention is 
not limited as to a particular routing criteria, although in one embodiment, the routing 
criteria is as described in the next section of the detailed description. 

10 

Routing Criteria 

In this section of the detailed description, a routing criteria according to one 
embodiment of the invention is described. The routing criteria is the criteria that governs 
when and how a user is alerted to a prioritized text by having the text routed - for 

1 5 example, forwarded to another email address, or such that the sender of the original text 
is replied to. The routing criteria is described with reference to FIGs. 14(a) and 14(b), 
which are diagrams of a user interface via which routing criteria options can be modified, 
according to one embodiment of the invention. 

Referring first to FIG. 14(a), two tabs are selectable, a forward tab 990 and a 

20 custom reply tab 992. In FIG. 14(a), however, the forward tab 990 is specifically 

selected, such that routing criteria with respect to forwarding a prioritized text such as an 
email message are shown. The user is able to specify an alternative email address to 
. which highTpriority emails are forwarded. More specifically, emails are forwarded to the 
address if the user has been away from the computer more than a predetermined amount 

25 of time, and a particular email to be forwarded has a priority greater than a predetermined 
threshold. For example, as shown in FIG. 14(a), the predetermined threshold is a priority 
of 95, and the predetermined amount of time is 600 minutes. Thus, if it is determined 
that the priority of an email message is greater than 95, and that the user has been away 
from the computer for more than 600 minutes, then the email will be forwarded to the 

30 specified email address. 



21 



MS127735.2 



Referring next to FIG. 14(b), the same two selectable tabs are shown, the forward 
•tab 990 and the custom reply tab 992. However, in FIG. 14(b), the custom reply tab 992 
.is specifically selected, such that routing criteria with respect to replying to the sender of 
a prioritized text such as an email message are shown. The user is able to specify a 
5 predetermined message that will be sent in the reply to the sender of the high-priority 
email message. Emails are replied to if the user has been away from the computer more 
than a predetermined amount of time, and a particular email to be replied to has a priority 
greater than a predetermined threshold. For example, as shown in FIG. 14(b), the 
predetermined threshold is a priority of 95, and the predetermined amount of time is 120 
10 minutes. Thus, if it is determined that the priority of an email message is greater than 95, 
and that the user has been away from the computer for more than 1 20 minutes, then the 
sender of the email will be replied to with the specified predetermined message. 

Conclusion 

1 5 Although specific embodiments have been illustrated and described herein, it will 

be appreciated by those of ordinary skill in the art that any arrangement which is 
calculated to achieve the same purpose may be substituted for the specific embodiments 
shown. This application is intended to cover any adaptations or variations of the present 
invention. Therefore, it is manifestly intended that this invention be limited only by the 

20 following claims and equivalents thereof. 
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